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Abstract 

We show that the total variation mixing time of the simple random walk on the giant 
component of supercritical G(n,p) and Q(n, m) is ©(log 2 n). This statement was only recently 
proved, independently, by Fountoulakis and Reed. Our proof follows from a structure result 
for these graphs which is interesting in its own right. We show that these graphs are "dec- 
orated expanders" — an expander glued to graphs whose size has constant expectation and 
exponential tail, and such that each vertex in the expander is glued to no more than a constant 
number of decorations. 

1 Introduction 

The mixing time T of a finite connected graph G, loosely defined as the time a random 
walk on that graph needs in order to be quite close to its stationary distribution, is an 
important concept in randomized algorithms and theoretical probability (see Section |2] for 
precise definitions). It has strong connections to the geometry of the graph — for example, 
c/A < T < Clog(|C7|)/A where A is the spectral gap of the graph and c and C are some uni- 
versal constants. This logarithmic factor is quite important, though. Typically graphs coming 
from applications in algorithms and statistical physics are exponentially large and have large 
spectral gaps, so this factor is crucial for the applicability of an algorithm. Thus extensive 
efforts have gone into understanding this factor better. See e.g. Lovasz and Kannan |24j : 
Morris and Peres [20]; or Goel, Montenegro and Tetali ;18 . 

The mixing time of random graphs in particular is a topic of research. Random d-regular 
graphs were the first examples of expanders (hence they have mixing time log(|c7|)). The first 
author and Mossel [S] considered the mixing time for the simple random walk on the largest 
percolation cluster in a box subset of the (i-dimensional integer lattice. Our purpose here is 
to do the same for the — a priori one might assume simpler — case of the largest cluster of 
random graphs. 

In this article, we consider two standard random graph models: the Erdos-Renyi graph 
G(n,p) in which every edge is taken independently with probability p and p = c/n for some 
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constant c > 1; and Q(n,m), the model where all graphs with n vertices and m edges are 
equally likely, for m ~ cn/2. It is well known that for such c there is asymptotically almost 
surely (a.a.s.) a unique giant component in the random graph. Our main result is the following. 
Here 0(/(n)) denotes a function that, for some positive constants c\ and C2, lies between 
ci/(n) and cif{n) for all n sufficiently large. 

Theorem 1.1 Let c > 1 and m ~ cn/2. The mixing time of the simple random walk on the 
giant component of Q(n,m) is a.a.s. 0(log 2 n). 

It is easy to see that this implies the corresponding result for Q(n,p). 

The lower bound is easy, and one way to see it is this. It is straightforward to show using 
standard techniques that for some c' > and with m as in the theorem, Q(n,m) a.a.s. has a 
path of degree 2 vertices of length at least d log n. On the other hand, the hitting time of one 
end of a path of length k is @(k 2 ). 

In Section [2] we give a new mixing time bound (Theorem I2.3j) for "decorated expanders", 
namely graphs which contain an expander subgraph B (we call B the "strong core") whose 
deletion leaves "small" components such that a bounded number are attached to any vertex 
of B. Then in Section^Jwe show (Theorem 14.2(1 that Q(n,m) is in fact a decorated expander. 
The upper bound in Theorem 11.11 follows immediately from these two results. 

The construction of the strong core is quite involved. We must stress that one cannot just 
take, say, the 3-core. First, it does not exist for all c down to the critical value 1. But even 
when it does exist, it does not satisfy all needed properties. See the definition of an AN-graph 
in Section ^. 21 As part of our argument, we give an explicit result on the expansion of random 
graphs with given degree sequences and all degrees at least 3 (lemma l5.3|) . This generalizes a 
number of known and folk results — the d-regular graph, the fc-core of Q(n,p) and Q(n,m), 
the fe-core of bernoulli percolation on a d-regular graph (see ^lEOj) and of course our strong 
core are all of this form. 

Independently of this work, Fountoulakis and Reed |16( I17j recently obtained the first 
proof of the 0(log 2 n) upper bound on mixing time. They show that the constant implicit 
in the bound is 0(c~ 2 ) for c bounded above by approximately Vhinln Inn. However, their 
computations only relate to Q(n,p), and so do not obviously imply anything for Q(n,m) since 
the mixing time is not a monotonic or even convex function on the lattice of subsets of the 
edge set of the complete graph. It is interesting to compare the two approaches. Fountoulakis 
and Reed start from the Lovasz-Kannan integral. Applied directly it gives Clog 3 n. This 
is necessary since, as Morris and Peres |29| discovered the Lovasz-Kannan integral in fact 
bounds the mixing time in the L°° norm which in our case is really ©(log 3 n). See 29 for a 
detailed discussion of the difference between these two notions. A similar problem exists with 
using the spectral profile ^H] on this problem. Fountoulakis and Reed find a variation of the 
Lovasz-Kannan integral which bounds only the usual, total-variation norm mixing time and 
gives the correct value for the case of Q(n,p). Our approach is more geometric. Once one has 
that G(n, m) is a decorated expander, the mixing time is evaluated easily with the help of the 
Lovasz- Winkler |2(i| theory of equivalences of the mixing time, a theory whose vast potential 
is yet to be exploited. 

Let us close this section with a few remarks 

1. Consider the critical case c = 1. It is well known that in this case the largest cluster is 
of the order of n 2//3 and there are more clusters of comparable size. Further, it is also known 
that the cluster can be split to two pieces, both of size at least era 2 / 3 which are joined by a 
bounded number of edges. Hence one gets that the mixing time T satisfies T > cn 2 ^ 3 , or in 
other words, the walk does not mix rapidly at all. This bound does not seem to be exact, 
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though. To understand why, examine the anomalous diffusion coefficient (5. In analogy with 
the results for random walk on Galton- Watson trees (0, see also jH]) one would expect that 
(3 = 8/3 or in other words that random walk on a critical cluster would exhibit sub-gaussian 

diffusion. Since the diameter of the cluster is n 1//3 , the walk needs (n 1 / 3 )' 3 = re 8 / 9 steps to get 
from one end of the cluster to the other, so a natural conjecture is that T > cre 8//9 . Whether 
or not this is precise, we do not know. 

2. Here is an easy corollary of our structural results, for the diameter of the giant com- 
ponent. The proof assumes familiarity with the later parts of the paper, but should be 
understandable at this point. 

Theorem 1.2 Let c > 1 and m ~ cn/2, and e > 0. The diameter of the giant component of 
Q(n,m) is a.a.s. B(logn). 

Proof The lower bound follows easily from the property that the giant component contains 
induced paths of length at least e log n a.a.s. The upper bound follows in two steps. Firstly, the 
a-strong core B, that exists a.a.s. in the giant component by Theorem 14.21 is an a-expander 
and consequently has diameter O(logn). Secondly, the attachments to it have size, and hence 
height, at most O(logn) a.a.s. I 

We note that sharper results on the diameter of these graphs have only recently been 
obtained by Fernholz and Ramachandran |15j and Bollobas, Janson and Riordan jllj . 

3. It would be interesting to extend the present result to the study of percolation on 
random regular graphs. It was shown in [1] that if the edges of a random cf-regular graph 
are deleted independently with probability 1 — p each, the threshold of appearance of a giant 
component is at p = l/(d — 1). We conjecture that the mixing time of the random walk on 
the giant component for fixed p > l/(d — 1) is again 0(log 2 n). This seems related to the 
analogous question for random graphs with a given degree sequence, which would also be of 
interest. 

2 Mixing times 
2.1 Definitions 

There are many possible definition of a "mixing time", and extensive literature devoted to 
proving relationships between the various definitions. We shall briefly sketch the terms we 
shall need, and refer the reader to |2£] for a more orderly and far more exhaustive introduction. 

Definition. Let Tq C T\ C • • • be a series of u-fields on a space Q. A stopping rule (for 
J- n ) r is a stopping time with possible external randomization, namely, there exists some ^2 
such that r : f2 x ^ an d such that for every u> G O2, T(-,uj) is a stopping time for J- n . 

Let G be a connected finite graph, let a be a distribution on the vertices of G, and let T 
be a stopping rule for a random walk R on G whose starting point R(0) is distributed like a. 
Denote 

al = F(R(T)=v). 

In other words <r r is the distribution of the location of R at time V when starting from a. We 
also say that T is a "stopping rule from a to <r r ". 
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Definition. Let G be a connected graph, and let a and r be two distributions on the vertices 
of G. Define the access time from a to r, denoted by TL(<j,t), using 

H((t,t) := min EI\ 
r :( r r =T 



The set of stopping rules from a to r is never empty: for example, it always contains the 
naive rule, namely, initially choose a vertex v using r, then walk until the first time v is hit. 
Naturally, in most cases this rule is not optimal. 

Definition. Let G be a connected, graph. The mixing time of G is defined as 

TL := max W(<7, n) 

u 

where the maximum is taken over all distributions a, and where tt is the stationary distribution 
of G. 

Note that we define the stationary distribution as the limit 

1 1 

tt v := lim ~J^F(R(k) = v) 

t^oo t — ' 
k=l 

and hence we do not need to assume that G is aperiodic. Recall that tt v is proportional to the 
degree of v, d v namely 

where E(G) is the number of edges of G. 

It is interesting to compare TL with more natural notions of the mixing time. For example, 
is it true that after t = \CTL~] moves of a random walk, that the distribution of R(t) is close 
to it in some norm? Generally the answer is no. For example, if G is a complete bipartite 
graph of size 2n with one edge added (so that G would be aperiodic). Then TL < C, but it 
takes approximately n 2 steps until the walk becomes mixed in the naive sense, since it needs 
a reasonable probability to traverse the only edge which makes G aperiodic. In this particular 
case, however, it is still possible to get a uniform distribution by randomly picking a fixed 
length, for example 1 or 2 with probability i, so for practical purposes, namely for an efficient 
algorithm to pick an approximately random point, it is quite reasonable to claim that the 
mixing time of the graph G is | . It turns out that this example is typical, in the sense that 
by picking the length of the walk randomly, independently of the actual steps taken, we get 
close to 7r by moves. For example one might take the length uniform between 1 and 

\cn\. 

Definition. Let G be a connected graph. The approximate uniform mixing time of G 

is defined by 

U e := mm | \a T — ir\ \ < e 

where T is a stopping time with probability 1/t for every time between and t—1 independently 
of the walk; where 1 1 • 1 1 stands for the L 1 norm (a.k.a. the total variation norm), i.e. | — r| | := 
\^i v ) ~ T ( v )\i an d where tt is the stationary distribution of G and e > is some parameter. 

There are other variation on this "random number of steps" theme. See e.g. |IJ for a 
continuous time random walk version, |25| theorem 7.2] for another version, and open problem 
17 in chapter 4 of [2]. 
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Definition. Let G be a connected, graph. The approximate forget time of G is defined 
by 

J- e : = minmax min 7i(a,fi). 

r a /t :||^-r||<e 

where e > is some parameter, and where || • || stands for the L 1 norm, i.e. \\fi — r|| := 
£>(«)- T(t;)|. 

T e is called a "forget time" because we consider stopping at r to be "forgetting" our initial 
distribution a. Perhaps surprisingly, the minimum r is not necessarily achieved at tt, and for 
directed graphs the ratio Tl/Fo may be arbitrarily large. See J2S] for a detailed discussion. 

Theorem 2.1 For any e < \, J- e = Q(7i) = Q(U e ) where the constants implicit in both G 
may depend on e. 

The inequality T e = ®(7~t) follows from theorems 3.1, 3.2 and 3.8 in [^H] — note that a random 
walk on a (non-directed) graph is always a time-reversible Markov chain. The inequality 
U e = Q(TC) comes from corollary 5.4 ibid. 

Definition. The (edgewise) Cheeger constant of a connected graph G is defined by 

$ := min —— V ir iPij 
o <7r( s)<i n(S) . J-^ 

where pij is the probability to step from i to j, namely 1/di if j is a neighbour of i and 
otherwise; where tt is the stationary distribution of G, and where tt(S) := ^igs 71 "*- 

Note that plugging in (|2.1|) we see that the element inside the min is, more or less the 
quotient of the number of edges leading out of S divided by the number of edges inside S, 
hence the name "edgewise" Cheeger constant. A similar value is called "conductance" in 

HH2H. 

We shall need the following connection between the Cheeger constant and the mixing time. 
Theorem 2.2 Let G be a connected aperiodic graph. Then 

H < Clog(l/min7riW. 

This was first proved by Jerrum and Sinclair in 22 a (the particular case of expanders, which 
is what we will use, was proved earlier by Alon [S], and in the continuous setting this goes 
back to Cheeger j!3j). We shall only use the form 

H<ClogE(G)-^ (2.2) 
which follows immediately from ()2.1|) . 



2.2 AN- graphs. 

Definition. We say that a connected graph 
expander) where a > is some number, if the 
properties: 

1. B is a a-expander, i.e. <&(B) > a. 



G is an a-AN graph, (or an a-decorated 
graph has a subgraph B with the following 
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2. 



The connected components Di of G\B are small in the following sense: denote by E'(Dj) 
the number of edges in G with at least one vertex in Di, or in other words, the internal 
edges of Di added to the edges connecting Di to B. Then 



#{i : E'(Di) > A} < E{G)e 



—Xa 



(2.3) 



In particular there are no components with E'{Di) > ^ log E(G). 
3. Each v £ B is connected to no more than — different Di-s. 

G \ B here denotes the graph reached after removing the vertices of B and all edges with 
at least one vertex in B from G. Note that the definition is meaningless for a > 1, so we will 
always assume a < 1. 

Theorem 2.3 The mixing time of an a- AN graph is < Ca~ 6 log 2 E{G). 

Note that the Cheeger constant of G might be < Ca 3 / log E(G). For example, take a subset 
of B realizing the maximum — assume it is small — and hang from each vertex ^ copies 
of straight line segments of length log E(G) /2a. Hence theorem 12.21 only gives a bound of 
Ca~ 6 log 3 E(G). In our application a will be a constant independent of n, so the result of 
the theorem is an asymptotic improvement. This example also shows that the mixing time 
is > ca~ 2 log 2 E(G) since if we start from the end of one such straight line segment we need 
that many steps to have a decent probability to exit it. 

Proof For every v,w £ B, let q(v,w) be the probability that a random walk on G starting 
from v hits B in w (in particular, q(v, w) ^ only if v and w are neighbours in G or if 
v and w have neighbours in the same Di). The symmetry of the random walk on G gives 
d v (G)q(v, w) = d w (G)q(w, v) — we will denote the degree of a vertex by d v (G) when it is not 
clear about which graph we are talking. Now construct a weighted graph with self loops B\ 
with the vertex set identical to the vertex set of B, and for any v,w E B\ make the weight of 
the edge between v and w be d v (G)q(v,w). It is easy to see that B\ is B with some added 
edges and some edges with increased weight, and the total weight of every vertex is increased 
by at most — due to requirement 3 from an a- AN graph. Hence 



We now use the clause 7i = @(U e ) in Theorem l2.H define tti to be the stationary distribution 
of B%; let C2 be some constant sufficiently large and define 



Let Si be the stopping rule stopping at time t = 0, . . . , J — 1 with probability -j. Let o\ be 
any distribution on B\. Then we get, for C2 sufficiently large, 



> a 2 . 



(2.4) 



J = \C 2 H(B 1 )] . 



(2.5) 



a 



f- 7ri|| <- Vcti. 



(2.6) 



Also, Theorem 12.21 allows us to estimate 

12~H 

J< \ClogE(Bi)Q~ 2 (Hi)~\ < Ca~ 4 log EiBi) <Ca~ 4 log E(G). 



(2.7) 



Next define stopping times Tj as follows: tq := —1 and 



Tj := min{t > Tj-i : R(t) G B}. 
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The point about the definition of B\ is that the regular random walk on B\ is identical to the 
process R{Tj). This allows to translate (|2,Bj) to the setting of the random walk on G: define 
S to be the stopping time that stops at tj, j = 1, . . . , J with probability j. Then for any 
starting distribution a on G, 

ll^-^H <-. 

Reaching the distribution tt\ is in effect forgetting <r, hence we may use the clause TL = 0(J r e ) 
of Theorem 12.11 to get that 

H < CmaxES < CmaxErj. 

(7 V 

Hence we have the task of estimating Etj. In general, if D is any graph and v G D; and if 
R is a random walk starting from v, then the expected time until R returns to v is l/ir(D) v . 
See chapter 2, lemma 5]. Take some j > and let D C G be 

D := \^J{Di ■ Di neighbours R(rj)} U {v £ B : 3i s.t. Di neighbours R(tj) and v}. 

The definition of an a-AN graph shows that there are < a -1 D^-s neighbouring R(tj) and 
each one satisfies E'(Di) < a -1 log so E(D) < a~ 2 \ogE(G), and then 

ED 

E^-t^IM^.) < 2E(D) <2a- 2 log E(G). (2.8) 

This estimate does not work for t\ since R(0) might not belong to B. Here we need the fact 
that for any graph D, the expected time that R takes from v to w is < 2E(D) ■ p where p is 
the electrical resistance between v and w. See [21 chapter 3, corollary 11]. Clearly p < \D\ 
since the resistance between v and w is at most the resistance of a path betwen them. Let 
therefore Di be the component containing R(0) and let D be Di U {its neighbours in B}. We 
get 

En < E{Df = E'{Dif < a' 2 log 2 E(G). (2.9) 
Collecting (|2?7|) . (I2~H1) . and (ESJ) we get that 

Erj < Ca" 6 log 2 S(G) 

which finishes the proof. I 

3 Random graph preliminaries 

We define an a-strong core of a graph G to be any subgraph B of with the properties as listed 
in the definition of the a-AN graph in Section 12.21 We seek an a-strong core of G G G(n,p) 
for p = c/n where c > 1 is fixed. It suffices (and indeed gives a stronger result) to consider 
G £ G(n,m) for m ~ cn/2. (See Bollobas [H] or Janson et al. [2^ for these basic definitions 
and results on random graphs). The 2-core of a graph or multigraph is the maximum subgraph 
of minimum degree at least 2. The 2-core, if it exists, is known to be unique; otherwise we 
say the 2-core is empty. It can be obtained by recursively deleting vertices of degree and 1. 
Define 

b = b(c) := 1 - t/c, 
where t = t(c) is the unique root of the equation 

ter t = ce~ c , te(0, 1). 
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The following results are well known. (See j^J for example, or |3L)| for more precise results 
on the joint distributions and related information. See [31] for definition of the combination 
of the notations o and a.a.s.) 

Theorem 3.1 The number of vertices of the giant component of G £ Q(n,p) is a.a.s. bn + 
o(n), the number of vertices of the 2-core of G is a.a.s. 6(1 — t)n + o(n), and the number of 
edges of the 2-core is a.a.s. b\n + o(n) where b\ = 6i(c) > 6(1 — t). The same results hold for 
G G G(n, m) with m ~ cn/2. 

Deleting the edges of the 2-core therefore a.a.s. leaves a forest F of 6(1 — t)n + o(n) trees with 
bn-\-o(n) vertices in total. We will condition on the event that the sizes of the giant and 2-core 
satisfy these conditions. Each forest of a given number of trees with a given number of vertices 
is equally likely to occur as F. Also well known is the following type of result. First, we say 
that the distribution of a random variable X has an exponential tail if P(X > j) = 0(e~ CJ ) 
for some c > 0. 

Lemma 3.2 Let g(n) be a fixed function with g{n) = o(n). Let G E Q(n,p), conditional upon 
the giant component having between bn — g(n) and bn + g(n) vertices, and the 2-core having 
between 6(1 — t)n — g{n) and 6(1 — t)n — g(n) vertices. The size of the tree in F containing a 
given vertex of the 2-core (conditional upon that vertex being in the 2-core) has an exponential 
tail. 

Proof We may fix the 2-core with s ~ 6(1 — t)n vertices, and assume the vertices of the giant 
componenet not in the 2-core are labelled 1, ... ,r (where r ~ btn). Then, after deleting all 
edges of the 2-core, each forest of s trees with root vertices in the 2-core (mutually distinguish- 
able from each other but unlabelled), and the non-root vertices labelled 1, . . . ,r, is equally 
likely to occur as F. The number of such forests is s(r + s) r_1 (see 28, p. 17]). The number 
of possible trees rooted at the first 2-core vertex, given the tree has j + 1 vertices (that is, 
j labelled vertices plus the root vertex) is (p(j + l)^' -1 ', where the first factor chooses the 
tree's vertices and the second constructs the tree. Additionally, of course, the remaining part 
of the forest is counted by (s — l)(r —j + s — l) r ~ 3 . Applying Stirling's formula and a little 
manipulation now shows that the probability that the first tree has size j is 

o( s r 3/2 ((i-P)e»)Y) 

where p = s/(s + r). Since c is fixed, (1 — p)e p is less than, and bounded away from, 1. The 
lemma follows. I 

The 2-core will not usually be an a-strong core of the giant component because it has long 
(length c'logn) paths of degree 2 vertices. The a-strong core we will be using is obtained 
from the 2-core by deleting paths of degree 2 vertices. We make this precise as follows. Define 
a 2-path in a graph G to be a path induced by vertices of degree 2 in G, and an isolated cycle 
to be a component of G that is just a cycle. 

Analogous to the definition for a random variable, we say that a set S = {s^} of nonnegative 
numbers has an exponential tail if some C > and C' exist such that for all j > 



\{i-Si>j}\ 
\S\ 



< C'e~ c K 



(Equivalently, we could work with the definition in which C" = 2.) Moreover, if the set S is 
indexed by n and there exist universal constants C and C' for which the inequality is true 
a.a.s. (as n — * oo), we say that S has an exponential tail a.a.s. 
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We obtain the following result quite easily using Markov's inequality and Lemma 13.21 
together with simple sharp concentration of the numbers of trees of a fixed size (say by applying 
Chebyshev's inequality after computing second moments along the lines of the calculation in 
Lemma l3.2j) . The proof is left as an exercise. 

Lemma 3.3 The set of sizes of trees in F a.a.s. has an exponential tail. 

4 Stripping processes 

Let N be any integer. Given a graph, we can perform a "stripping" process that first removes 
all vertices not in the 2-core, G, of the graph and then recursively deletes isolated cycles and 
the vertices in 2-paths of length greater than N, as well as vertices of degree less than 2 (which 
arise if both ends of a 2-path are adjacent to a vertex of degree 3) and vertices with more 
than N removed neighbours, until no more deletions are possible. Note that the resulting 
graph does not depend on the choices made at each step, since, once a vertex can be deleted, 
it remains deletable in any subsequent step. Presumably when G is the 2-core of the giant 
component of a graph in Q(n,m) (with m as in Theorem 13. ip . this resulting graph is, for N 
sufficiently large, an a-strong core a.a.s. However, to make the proof easier we will modify 
this stripping process. 

To analyse such processes we will consider the kernel K(G) of G, defined for G with 
minimum degree at least 2. The kernel is obtained by replacing each maximal 2-path that 
joins two vertices u and v of degree at least 3 by an edge uv, and deleting each isolated cycle. 
It is possible that the kernel possesses loops and/or multiple edges. A loop contributes 2 to 
the degree of its incident vertex, so 5(K(G)) > 3 (6 denoting minimum degree). 

The kernel was used in |2Q] to derive properties of the 2-core of the random G G Q(n,m). 
In particular, it is easy to obtain the following from the results there. Here b is the same as 
defined in Section [3 

Lemma 4.1 The number of vertices of degree 2 in the 2-core of G 6 Q(n,m) is a.a.s. + 
o(n) for a constant 62, depending on c, with < 62 < b(l — t). 

It follows from this and Theorem l3.1l that the size (number of vertices) of the kernel is similarly 
sharply concentrated at (6 — 62)^ and the number of edges in the kernel is also sharply 
concentrated at (b\ — b2)n. 

In examining the stripping process, it is difficult to keep track of the distribution of lengths 
of those 2-paths containing vertices of degree 2 that still remain but were adjacent to removed 
vertices. So we define another stripping process, called severe stripping, that in general removes 
more than is necessary, always erring on the safe side. This can be applied to a graph or 
multigraph Go with 6 (Go) > 2 and with no isolated cycles. To guide the process, some edges 
and vertices of the kernel are designated as red. Colouring an edge or vertex of the kernel of a 
graph red marks the corresponding part of the graph for removal during the severe stripping 
process. All edges incident with red vertices are also painted red. 

To initialise this process, begin with any graph G and obtain Go from the 2-core of G by 
deleting the isolated cycles. For any vertex v of Go that is adjacent to at least N — 1 vertices 
of G — Go, if dc (v) = 2 then the edge of the kernel K(Gq) corresponding to the maximal 
2-path containing v is painted red, whilst if dc (v) > 3 then it is a vertex of K(Gq) and is 
painted red, as are all the edges of K(Gq) incident to v. Also, with N as above, take note 
of all maximal 2-paths of Go containing more than N/2 vertices: the corresponding edges of 
K(Gq) are coloured red too. For later reference, we call Go the trimmed core of G, and the 
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graph K(Gq) together with its colouring defined in this way is called the painted kernel of Go 
with respect to G. 

During the process, some other kernel edges will be coloured purple, and of course the kernel 
will be modified as the graph changes. Purple edges correspond to 2-paths whose lengths have 
been "exposed" in the sense that they have been used to influence the algorithm, and yet 
which are not (yet) required to be removed. Also, some kernel vertices will be coloured pink, 
to signify that they have already lost a neighbour during the process. Any pink vertex that 
loses a second neighbour is immediately recoloured red. This is to ensure, without excessive 
bookkeeping, that, by the end of the process, all remaining vertices have lost at most N 
neighbours each. (We may assume N > 2.) The looseness this causes in the final bound only 
affects terms that we are not attempting to maximise. 

To simplify the argument further, we will also avoid keeping a record of the length of 
a purple edge at any steps after it is first formed. Hence, we must remove a purple edge 
whenever it is merged to another edge, i.e. when a vertex v adjacent to one of its ends drops 
to degree 2. 

Formally, severe stripping defines a sequence Go, Gi, . . . of graphs and colourings of their 
kernels as follows, beginning with the trimmed core Go and its painted kernel with respect to 
G. 

For step i, select a red edge of K{Gi-\) uniformly at random from all the red edges. 
Remove the corresponding maximal 2-path 70 of Gj_i, to obtain the next graph Gj. Let u 
be the vertex of Gj adjacent in Gj_i to one end of 70. If u still has degree at least 3, colour 
it pink if it was uncoloured in K(Gi-i), whilst if u was already pink, colour u and all edges 
of K(Gi) incident with u red. On the other hand, if u has degree 2 in Gj, let e be the edge 
of K(Gi) corresponding to the maximal 2-path 71 of Gj containing u. Colour e red if either 
of the two edges of if(Gj_i) which form u was already purple or red, and colour e purple 
otherwise. All other edges and vertices of K(Gi) inherit their colours from K(Gi-\). All red 
vertices will be deleted eventually — when their degree drops to two they will be deleted from 
the kernel and will remain as part of the path of the graph Gj corresponding to a red edge of 
the kernel, and when that edge is deleted the vertex will be removed from Gj (in some special 
cases below, both deletion steps will happen at the same time). 

There is a special case: if u is contained in an isolated cycle of Gj, then the whole cycle is 
removed. Treat the vertex v! adjacent to the other end of 70 separately with the same rules. 
If u = u' the rules above apply in the obvious way, unless the degree of u falls to 1 when 70 
is removed. In this case, remove the maximal path of degree 1 and 2 vertices that contains u 
from Gj_i, let u" be its point of attachment and repeat the above colouring rules treating u" 
as u. 

The severe stripping process continues, repeating the above step, until the point is reached 
that no red edges remain in K{Gm)- The N -reduced core of a graph G, denoted i?jy(G), is 
the final graph Gm obtained by applying the process starting with the trimmed core and its 
painted kernel with respect to G. This may seem to depend on the order of choosing the 
red edges for removal, but it is actually unique, which is convenient for descriptive purposes 
but unimportant for our arguments. The uniqueness can be seen, by observing that severe 
stripping is equivalent to recursively removing all red edges and vertices, any vertex which 
has had at least two incident edges removed, and any edge of the kernel that at any point has 
more than N degree 2 vertices or comes from merging at least three of the edges of K{Gq). 

One aspect of the definition of severe stripping may seem redundant at this point: one 
could avoid painting vertices red if one instead painted all incident edges red. The resulting 
process would be the same, but in the analysis we need to know which vertices of the random 
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graph have been investigated in some sense, so for this reason the red vertices are recorded. 
Using the severe stripping process we will obtain the following. 

Theorem 4.2 Fix c > 1 and let G G Q{n,m) where m ~ cn/2. For N sufficiently large 
(depending on c) and a sufficiently small, Rn(G) is a.a.s. an a-strong core of G. 

The proof of this theorem is spread out over the next two sections. 

For simplicity we redefine n and m so that G £ Q(n, m), and we condition on the numbers 
n of vertices and m of edges in the 2-core of G. The 2-core is distributed u.a.r. (uniformly 
at random) as a graph with these parameters n and m, and minimum degree at least 2. The 
parameters are sharply concentrated as discussed above, so we examine for a while the random 
graph space Q2( n , m ) containing all (n, m) graphs with minimum degree at least 2. 

To study this, and in particular the severe stripping process, we take the approach in |12j . 
which reveals that the following model, used by Bollobas and Frieze jlUj and Chvatal |14j . 
is very convenient for such purposes. (A similar idea was used in pj.) A random element of 
this model, which we call C(n,m), is obtained as follows. Start with n isolated vertices and 
add m edges by choosing each end of each edge uniformly at random. All choices are made 
independently with replacement. This is equivalent to choosing m labelled oriented edges and 
then forgetting their labels and orientations. The result is a pseudograph that can have loops 
and multiple edges. Note that the restriction to simple graphs gives precisely Q(n,m) (the 
uniform space). 

Now define the probability space Cfc(n,m) to be the restriction of C(n,m) to the graphs 
with minimum degree at least k. We will proceed to analyse severe stripping applied to 
C2(n, m), for appropriate values of m. 

From |12j for example, we know the following. 

Lemma 4.3 // we repeatedly delete vertices of degree and 1 from C(n,m), the final result, 
conditional on its numbers n' andm! of vertices and edges, is distributed precisely asC2(n',m'). 

The proof is simple enough to be omitted, using induction on the steps of the deletion process 
(see below for more complicated applications of this technique). With a similar step- by-step 
approach, we easily obtain the following, which bears some resemblance to |.'~S()I Lemma 3]. By 
suppressing a vertex v of degree 2, we mean joining its two neighbours with a new edge and 
then deleting v. 

As Luczak |27j observed, we know that a.a.s. there will only be a small number of vertices 
in isolated cycles in C2(n, m), which will affect the parameters after those cycles are discarded. 
To avoid switching notation after deleting a small number of vertices, we proceed initially as 
if no isolated cycles occurred. Let Ciin, m)* be the probability space derived from C2(n, m) by 
restricting to those pseudographs with no isolated cycles. Note that for the following lemma 
and similar statements, assuming that n-vertex graphs have vertex set [n] = {1, . . . ,n}, we 
should map the n 1 vertices of the kernel into the set [n'\ in a canonical way. Each time a vertex 
of degree 2 is deleted, the remaining vertices may be renumbered, preserving the ordering. 
This renumbering is sometimes done implicitly in our arguments. 

Lemma 4.4 If we begin with a random member M of C2(n,m)* and suppress vertices of 
degree 2 repeatedly until none remain, the result, conditional on its numbers n' and m' of 
vertices and edges, is distributed precisely as C^{n' ,m'). 

Proof For this proof, we may retain the labels and orientations of the edges of M as in the 
definition of the model. Then, conditional on the set of degree 2 vertices in M, it is uniformly 
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distributed. To suppress a degree 2 vertex, choose the vertex v to be suppressed and the end of 
an incident edge x to delete. The other edge incident with v is extended to meet the vertex at 
the other end of x, while both x and v are deleted, to obtain a pseudograph M' (with oriented, 
labelled edges). It is clear that the number of ways to reverse this operation is independent of 
M', given the labels of v and x (which must be missing in M'). By induction, the pseudograph 
obtained after suppressing k of the degree 2 vertices in this manner is uniformly distributed, 
given its set of vertices, edges, and degree 2 vertices. The lemma follows from this statement 
applied to k being the number of vertices of degree 2 in M. I 

From Lemma 14.41 the kernel K(Gq) can be modelled by C^{n' ,m'), where (n',m') will 
be restricted to the range of the sharp concentration shown above from known results about 
simple graphs. Our conclusions that are a.a.s. true for kernels with this range of values will 
then be shown to apply to the case that the initial graph was simple. 

We call it a random ordered assignment of a given set of vertices to a given set of edges of a 
pseudograph if the given vertices are randomly assigned to those edges and the ones assigned 
to a particular edge placed along it in some order, such that, with parallel edges canonically 
distinguished from each other and loops given a canonical direction, each assignment (including 
the ordering along each edge) is equally likely. 

Lemma 4.5 The elements of Ci{n,rn)* having a kernel with vertex set [n'] and with m! 
edges are obtained with uniform distribution by starting with the kernel randomly taken from 
Cz{n' ,m'), and then using a random ordered assignment g of the vertices of degree 2 to the 
edges of the kernel. 

The proof is omitted as it is very similar to the proof of Lemma 14.41 The version for simple 
graphs was used by Luczak [27j and in |3U1 Section 4] where kernels are examined conditioned 
on degree sequence. 

Lemma 14.51 shows that to analyse the severe stripping algorithm applied to C2(n,m)*, we 
may consider a random kernel taken from C%{p! ,m') and a random ordered assignment g of a 
given set So of degree 2 vertices (for all the appropriate values of the parameters) . We need 
to use a version of the method of deferred decisions: we do not examine the end of any purple 
or red edge until it is needed for a decision in the severe stripping algorithm. For a precise 
description, we argue as in [T2] but with a model similar to the kernel configuration model 
of [313 • Model Cs(n',m') as the set of random functions / from [2m'] to [n'\. The pair of 
vertices f(2j — l)f(2j) forms an edge for 1 < j < m' . Thus, the function g maps the set 
of degree 2 vertices (of which there are m — m') to {j : 1 < j < m'}, where the number j 
represents the edge f(2j — l)/(2j). We say that this edge has label j. 

The presence of a painted kernel affects the process, so we have to define some sets for 
special attention. At the end of the zth step of the stripping process, let Sj C S denote the set 
of degree 2 vertices on noncoloured edges (i.e. non-red non-purple edges) of G{. Let Ri denote 
the set of all j such that f(2j — l)/(2j) is a red edge of K(Gi), and Pi the corresponding set 
for purple edges. Also VRi and VRi are the sets of red and pink vertices respectively. As with 
the vertex labels, at each step that an edge is deleted, the edge labels are compressed into the 
range [m'] (where m! is the number of edges of the kernel of the new graph) and the action 
of / and g is modified accordingly. When two edges of the kernel coalesce into one (due to 
a common adjacent vertex being reduced to degree 2), a similar canonical relabelling of the 
edges is carried out in which the new edge is given some canonical label, say the largest edge 
label. 

In the end we will show that we only need to deal with a stripping process with the starting 
graph Gq drawn uniformly at random from Ciin, m)* . We can generate a random element 
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of this model while performing the stripping algorithm, "exposing" only those parts of the 
graph as required for steps of the algorithm. After step % the exposed parts are the labels of 
the purple and red edges of K(Gi), the degree 2 vertices on each of these (i.e. that part of the 
ordered assignment g), and all the values f(i) contained in VRi. Thus, initially the labels of 
the red vertices and their preimage under /, and the members of So assigned to red edges, 
are all given. When the vertex u at the end of a red edge with label j is investigated, the 
value f(2j — 1) or f{2j), as the case may be, is first chosen from the non-red (i.e. uncoloured 
and pink) vertices, and then it is decided (randomly, with the correct probability, which the 
following lemma gives a simple way to calculate) if u has degree 3. If u has degree greater 
than 3, it is coloured pink but the remaining part of is kept random, i.e. not exposed. 

If degree 3, the two adjacent edges are determined (i.e. the two remaining elements of / -1 (tx) 
are decided), as is the part of g assigning vertices to these edges. Provided that these two 
edges are distinct, they coalesce into one new purple or red edge (depending on the sizes of 
those preimages). For this new edge, only its label ml is known in this step, and not the 
endvertices of the edge. On the other hand, if the two elements of / _1 (it) belong to the same 
edge, it is simply removed, because it corresponds to the appearance of an isolated cycle in 
the stripping algorithm. 

The following lemma asserts that the unexposed part of the graph remains nicely random, 
in order for the whole "exposing" process to work as described. First, note that we defined 
the trimmed core and the painted kernel for multigraphs and hence they apply to members 
of C(n, m). 

Lemma 4.6 Let Gq be the trimmed core of a random multigraph G in C(n,m), and colour 
K(Gq) as the painted kernel of Go with respect to G. Then apply the severe stripping process 
to obtain Gi,G2, ■ ■ ■■ Next, condition on \V{K(Gi))\ = n' and \E{K{Gi))\ = m! , on the sets 
Ri, Pi, VRi and VP. L , and on / _1 (j) for all j £ VRi. Then the remaining values of f are 
distributed uniformly at random on [n] \ VRi conditional upon |/ _1 (i)| > 3 for all j. 

Proof A key thing to realise for this proof is that the conditioning described does not put 
any constraints on the relative positions of the coloured edges and vertices. 

The lemma is proved by induction on i . This is similar to the proof of Lemma 5 in ^21 > only 
more complex because various cases of encountering red or purple edges need to be considered. 
The case i = follows from Lemmas 14.31 and 14.51 since the number of ways to reinstate the 
isolated cycles is independent of the values of /. 

We will now show how step i + 1 follows from step i. Let R4+1, Pi+i, VRi + \ and VPi + \ 
be given, and let f\ and fi be two functions from [2m'] to [n'j such that \fj7 > 3 that 
satisfy the compatibility condition that fk(j) £ VRi + \ implies that fz-k{j) = fk(j)- We need 
to show that both functions fk have the same probability. The function fk could come from 
some gk in the ith step by various means. Let us take as an example the case where a red 
edge connected to two uncoloured vertices of degree at least 4 is removed (and the vertices 
are hence coloured pink). To reverse this process, one must find two pink vertices p\ and P2, 
uncolour them, add an edge between them at some position s £ [m'], and relabel the edges. 
Examine some specific p±, P2 and s. It is easy to see that they dictate the sets Ri, Pi, VRi 
and VPi and further that the two functions g\, 52 satisfy that \gT > 3 as well as the same 
compatibility condition the functions fk satisfy. Hence (by induction) they have the same 
probability. Further, since the red edge to be removed is selected randomly, the probability 
of s to be selected for removal for g\ is the same as for g<i- Hence they contribute the same 
amount to the functions Since this holds for any values of p\, P2 and s, we get that the 
total contribution of our example case (a red edge connected to two uncoloured vertices of 
degree > 4) is the same to f\ and to fi- 
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As a second example, let us take the case that one end is uncoloured with degree 3, and 
its other two edges are also uncoloured. The stripping rules require us to suppress the vertex 
and colour the new edge purple. Hence the reversal process consists of finding a pink vertex p, 
verifying that the highest labelled edge q is purple, finding a location for the removed vertex 
t, three locations for new edges, s\ for the red edge removed and S2, S3 for the uncoloured 
edges merged. As above we see that the reversal process does not depend on the "remaining 
values of /" and the argument goes through unchanged. We will not bore the reader with any 
more cases. I 

This lemma is used in the proof of the next result. We will use e^v to denote some function 
that is constant for fixed ./V but goes to as N — > 00 (perhaps different functions at different 
occurrences of the notation). 

Lemma 4.7 Let m ~ cn/2 with c > 1. Let Go be the trimmed core of G G C(n,m), and 
let n and rh be the (random) numbers of vertices and edges of K(Gq). The numbers n' and 
m' of vertices and edges of the kernel K(Rn(G)) of the N -reduced core of G a.a.s. satisfy 
n' = n — 0{e^n) and m! = rh — O(eArn). Moreover, conditional upon having particular values 
of n' and m! , K(Rn(G)) is distributed as C^{n' ,m'). 

Proof The simplest part is the last, as it follows directly from lemma POfl Indeed, for any 
possible value H of K(Rjsr(G)), its probability comes from a sum over all its realizations 
as a function / : [2m'] — * [n'j, all possibilities for the number of trimming steps i and all 
possibilities for Pj and VPi. However, the number of possibilities does not depend on the 
structure of H at all, and the probability of each quadruple /, i,P«, VPi does not depend on 
/ by lemma l4~6l This shows that K(Rn(G)), conditioned on m! and n' is indeed distributed 
as Cs(n', m'). 

As noted in Section EI conditioning on G G C(n,m) being simple is equivalent to taking 
G G G(n,m). For such G, by Theorem 13.11 and Lemma 14.11 we have h ~ a\n and rh ~ 02^1 
a.a.s., for some constants a\ and 02 with a\ < a<i and depending only on c. The same 
concentration then holds also for G G C(n,m) by a quite simple argument: an alternative 
way to generate G G C(n,m) is to first decide how many loops, £, and multiple edges, j, it 
has (and their multiplicities mi, . . . ,mj) with the correct probability, generate an underlying 
simple graph G* at random, and then adorn G* with t loops at random locations, and the 
required extra copies of j of its edges. The distribution of G* should be uniform with n vertices 
and m — £ — Yli( m i ~ 1) edges, and the locations of the loops and multiple edges are chosen at 
random. Simple calculations with Markov's inequality show that £ + Y2 m i = O(logn) a.a.s. 
Furthermore, by Lemma 1531 adding a loop or giving an edge of G* extra parallel copies, a.a.s. 
will not increase the size of its 2-core by more than O(logn) (and of course cannot decrease 
it). Adding a loop or an extra copy of an edge can only increase the kernel size by 2 vertices 
or 3 edges (the extreme case is that of adding an edge parallel to an edge in the middle of a 
path of vertices of degree 2). It follows that the concentration in Theorem 13 . II and Lemma 14. II 
also applies for G G C(n,m), in particular n ~ a\n and m ~ a^n a.a.s. 

From the above paragraph, the numbers of vertices and edges of the 2-core of G are a.a.s. 
6(1 — t)n + o(n) and b\n + o(n) respectively, and the number of degree 2 vertices is a.a.s. 
b2n + o(n). Recall that b\ > 6(1 — t) > 62. Deleting isolated cycles to obtain Go as the 
trimmed core of G will maintain uniform randomness, provided its numbers of vertices and 
edges are conditioned upon (and conditional upon having no isolated cycles). As observed 
by Luczak j^T], the number of vertices in isolated cycles of the 2-core is small; it is easy to 
show that it is bounded in probability, or a.a.s. O(logn) for example. So these do not affect 
our argument and we ignore them. The argument above shows that Lemma 13.31 applies also 
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to G £ C(n,m). It follows that the number of vertices of Go that are adjacent to at least 
N — 1 vertices of G — Go is a.a.s. at most e^vn. This is therefore a bound on the number of 
red vertices in the painted kernel of Go with respect to G, and on the number of edges that 
are coloured red because they correspond to a maximal 2-path of Go containing such a vertex. 
Note that, given Go and the number r of red vertices, each r-set of vertices of Go is equally 
likely to be the set of red ones. 

Similarly, since 6(1 — t) > 62, the average number of vertices of Go assigned to an edge 
of its kernel is bounded, and, using Lemma 14.51 has an exponential tail. It follows that the 
number of edges of K(Gq) that are painted red at the start of the stripping process is also at 
most e^n. 

At the start of the stripping process, the kernel has n vertices. From Lemma 13.31 and 
similar elementary analysis, the proportion of its edges and vertices that are red is at most 
€n. We need to consider how many red edges or vertices are produced during the step in 
which Gj+i is obtained. Purple edges and pink vertices merely denote "potential trouble" and 
will remain at the end, and we need their number to remain small. To aid in this, we define 
for fixed e' > the stopping time T(e') to be the the smallest value T such that at least one 
of the following holds: 

• T > e'h, 

• K(Gt) has no red edges and no red vertices, 

• K(Gt) has more than e'n coloured edges and vertices. 

Here T is a stopping time for the exposition process, formally with respect to the c-fields 
generated by Ri, Pi etc. We will examine the behaviour of the process up to the stopping 
time T(e'), for e' and N fixed, and note the behaviour of the conclusions we draw, as N — > 00. 

We use Lemma 14. HI frequently. Note first that each stripping step deletes an absolutely 
bounded number of vertices and edges from the kernel. Hence by the definition of T(e'), for 
% < T(e'), there are at least n/2 uncoloured edges in K{Gi) (for e' sufficiently small). Also, 
the number of new pink vertices or purple edges each increase by at most 2 in each stripping 
step. So K{Gi) has at most 4e'n such elements for i < T(e'). 

We must also examine the distribution of numbers of red edges in K(Gi) for i < T(e'). In 
Step i + 1, first assume that the vertex u adjacent to the end of the edge 70 to be removed 
is uncoloured and of degree 3. The probability that either edge incident with u was already 
purple is, using Lemma 14.61 and the bound on the number of purple edges, at most O(e') 
(where the implicit constant in this bound is independent of N). So O(e') is an upper bound 
on the probability that a new red edge is created. On the other hand, the probability that 
u is already pink is O(e') by Lemma 14.61 and the conclusions in the above paragraph; then it 
becomes red and all incident edges become red. The distribution of the number of such edges 
is asymptotically truncated Poisson (the distribution is actually multinomial conditioned on 
|/ _1 (i)| > 3 for all j). This has an exponential tail. 

We conclude that there is an upper bound O(e') on the expected number of new red edges 
arising in every step of the process, with an exponential tail, up until time T(e'). This is, 
at each step, conditional upon the state of the process in the previous step. Note that the 
increase in the number of red edges in one step is 0(log 2 n') with probability at least 1 — o(n~ 2 ). 
A standard supermartingale inequality now shows that a.a.s. the total number of red edges 
created up to time T(e') is 0(T(e')e') (see (S31 Corollary 4.1]; the last paragraph of the proof 
of Theorem 5.1 in that reference explains how to handle the fact that the expected change in 
the number of red edges is not bounded by a constant). As T(e') < e'h + 1 by definition, there 
are a.a.s. at most 0((e') 2 h) new red edges. Moreover, since every step of the process uses up 
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a red edge (of which there are initially at most e^h) this implies that T(e') = 0{e^ + (e') 2 )n. 
Now take a concrete instance g(N) of this function ejv, an d let e' = e'(N) = \J 'g(N). Then 
T(e') = o(e'n) as N — » oo. Hence, a.a.s. in the definition of T(e') it is the condition that there 
are no red edges nor vertices that is the binding one. That is, a.a.s. the whole process lasts for 
at most e'(N)n steps, where e'(iV) — > as n — > oo. Since Rn(G) = Gm, the lemma follows. 
I 

We denote by G — E(H) the spanning subgraph of G with edge set E(G) \ E(H). Note 
that no vertex of Rn(G) has more than one edge to Go — Rn(G), since otherwise all its edges 
would be painted red and so the vertex must be deleted eventually. Similarly, when Go is the 
2-core of a graph G, the edges incident with vertices of Go that are adjacent to at least N — 1 
vertices of G — Gq are initially made red. There are 0(eNn) of these, and these vertices cannot 
survive in Gm- It follows that no vertex of Rn(G) has more than N edges to G — Rn(G). 

To prove Theorem 14.21 we also need to check the condition on sizes of components. We 
first need a preparatory lemma on a sort of coalescing branching process. This is a simplified 
version that does not apply directly to the actual process we need to consider, but will, with 
appropriate choice of Z, provide a useful comparison via stochastic domination. 

Lemma 4.8 Fix < e < 1/2 and a nonnegative random variable Z with E2 < e and with 

exponential tail. Suppose that a graph F n with n vertices has component sizes with exponential 

tail. Suppose furthermore that at most 5n vertices are marked, and that these include all the 

vertices in nontrivial components. Now process the marked vertices successively by adding 

edges to a random set of neighbours. The random number of neighbours chosen is distributed 

according to Z , and, given the number of neighbours, the neighbours themselves are chosen 

uniformly at random from all vertices. (These choices are done independently at random 

for each vertex processed. For simplicity, we permit loops, so a vertex may choose itself.) 

Each vertex processed becomes unmarked, and each unmarked isolated vertex that is joined to 

( f) 

becomes marked. The process finishes with a final graph, Fn when all vertices are unmarked. 

( f) 

Then, for 5 sufficiently small, a.a.s. the component sizes in Fn have an exponential tail, and 
a.a.s. there are at most en vertices in nontrivial components. 

Here 5 only has to be smaller than some absolute constant c. 

Proof Let us perform an equivalent process, in two stages: first growth, and secondly identifi- 
cation and pruning. Let S denote the set of marked vertices. In the first stage, for each vertex 
v £ S, perform a Galton- Watson branching process with birth law given by the distribution 
of Z, and originating with the single individual v. All the children in all these processes are 
at this point represented as separate vertices in a set T, where S n T = 0, and the branching 
processes are represented as trees. 

In the second stage, perform random identifications of vertices in the trees generated in 
the first stage: each vertex u in S is taken in turn, and for each such u, each child vertex 
w in its branching process is taken in turn (working up the tree away from u). Then, with 
probability k/n, the vertex w is identified with one of the vertices previously processed in this 
second stage, where k is the number of such previous vertices. If identification occurs, the 
vertex to identify with is picked at random. Furthermore, all vertices in the branches of the 
tree above w are deleted. 

If we now add all edges present in F n , it is clear that we obtain a graph with the same 
(f) 

distribution as Fn . 

( f ) 

To bound the component sizes of F n , we analyse the equivalent process without the 
deletion steps. Start with a vertex randomly chosen in S. The size of its component in F n has 
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exponential tail (in the probabilistic sense) and has expected size 1 + 0(e). Each vertex in 
its component is in S, and we may consider the tree of each one separately. The size of each 
of these trees has an exponential tail, and expected size 1 + 0(e). Each vertex in the tree is 
identified with a number of vertices processed earlier or later, and the number of these has an 
exponential tail with expected size 0(5). So we may consider a new branching process, the 
children of a vertex being the new vertices in any tree reached by identification. The number 
of children then has an exponential tail, and hence, using j311 Theorem 3.3], so does the size of 
the new branching process. Furthermore, it is easy to see that its expected size is l + 0(e + 5). 

Because no truncations occur, the sizes of the new branching processes are independent 
and identically distributed. The rest of the proof is straightforward (c.f. Lemma l3.3j) . I 

To apply this lemma, the initially marked vertices are the initially red edges. New marked 
vertices are new red edges. 

Lemma 4.9 Let G be as in Lemma \4- 1\ For N sufficiently large, the set of numbers of edges 
in the components of G — E(R^(G)) a.a.s. has an exponential tail. 

Proof Define Go as in the proof of Lemma 14.71 It was shown there that the proportion of 
edges and vertices of the painted kernel that are red is at most e^v. 

The number of new red edges generated in any one step, as observed above, has an ex- 
ponential tail with a truncated Poisson approximation. We need to consider the components 
Ci,...,Cj induced by all the edges of the kernel K(G$) that are red or are subsequently 
painted red. We may begin with the initially red edges and vertices, all considered as marked 
vertices in some graph F q as in Lemma [4.81 These are arranged in components of F q according 
to the respective components in the subgraph of K(Gq) that they induce. These component 
sizes are easily seen to have an exponential tail. This follows because of two things: firstly, as 
noted in the proof of Lemma l4.71 the red vertices occur as a set chosen uniformly at random, 
and similarly the red edges that are not adjacent to red vertices. The second ingredient is 
that the distribution of degrees of the red vertices will be determined by the distribution of 
degrees of the vertices in Go, which are, by the results in ^21 > multinomial and hence in the 
limit Poisson. We omit some details, as this part of the proof is straightforward. 

The other vertices and edges of K(Gq) are all vertices of F q . We may assume as above 
that the number of red edges is always at most e^n. As each red edge is processed, its vertex 
in F q joins with at most probability ev to one or two other vertices of F q . It is thus seen that 
the sizes of components in the resulting graph are bounded above by those of an associated 
process of the type analysed in Lemma l4.8[ and the variable Z has E Z < ejv < L 

From Lemma 14.81 we deduce that the sizes of C±,. . . ,Cj a.a.s. have an exponential tail. 
From Lemma 14.51 it is easy to see that the set of numbers of degree 2 vertices in the maximal 
2-paths of the 2-core of G a.a.s. have an exponential tail (and this applies equally well if the 
isolated cycles are regarded as maximal 2-paths). Finally, Lemma 13 . 31 savs that the set of sizes 
trees attached to each vertex of the 2-core of G has an exponential tail a.a.s. Recall that these 
are attached randomly. Combining these statements using |311 Theorem 3.3] gives the result. 
I 

As we shall show formally at the end of the next section, all that remains to prove Theo- 
rem is t° verify that the iV-reduced core a.a.s. satisfies the required expansion property. 
This is considered in the next section. 
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5 Expansion of the kernel 



Let Q{d) denote the uniform probability space of the graphs with degree sequence d = 
(rfi, . . . ,d n ). To model this probability space we may use the pairing model (see [HH)- Here 
there are cells labelled 1, ... ,n with di points in the ith cell. A uniformly random pairing 
of all the points is selected, denoted V{d). Regarding the cells as vertices, this produces a 
random (pseudo-)graph G on n vertices. It is easy to check that the following holds (see ^2] 
for example). 

Lemma 5.1 The distribution of G arising from V(d) is exactly the same as that obtained by 
restricting C(n,m) to graphs with degree sequence d. 

To make the connexion with the results in Section |I] we still need the following. 

Lemma 5.2 For m = 0(n), there exists C > such that G £ C(n,m) is simple with proba- 
bility at least C . 

Proof It is straightforward to show using the method of moments that the numbers of 
loops and pairs of parallel edges are asymptotically independent Poisson in distribution. The 
expected numbers are m/n and (™) /n 2 respectively, which are both O(l). The result follows. 



So we may focus on the pairing model, and Lemma f4.7l tells us that we only need to consider 
a set of degree sequences that a.a.s. contains the degree sequence of £3(71,771), for m = 0(n). 
The convergence expressed in the following lemma is uniform over all degree sequences d in 
the stated range. 

Lemma 5.3 For some a > 0, the random multigraph Gd arising from the pairing model V(d) 
with 3 < mindj < maxdj < nP' 02 is a.a.s. an a-expander. 

Proof First, note that $ is the minimum of e(S)/d(S) over all sets S of vertices whose sum 
of degrees is at most the number of edges of the graph, where d(S) is the sum of degrees of 
vertices in S and e(S) is the number of edges leading out of S. 

In the proof of |27l Lemma 12.6], Luczak shows that, conditional on a given degree sequence 
d with minimum 3 and maximum at most n 0,02 , the multigraph arising in the random pairing 
V(d) a.a.s. has no subgraph on r vertices, 2 < r < n 0A , with more than 1.2r edges. It follows 
that each set S of at most n 0A vertices has e(S)/d(S) > 1/5. Much simpler calculations show 
that the same is true for r = 1: the expected number of pairs of loops with the same vertex 
is o(l). 

For the sets of vertices between n 0A and n/2, Luczak only establishes a constant lower 
bound on the number of edges leaving the set. Here we more than fill the gap by computing 
the expected number of sets of vertices S with d(S) = q, where n 0,2 < q < m (m = \E(Gs)\) 
and e(S) = t. Note that the upper bound maxdj < n 0,02 is not required in this part. 

Assume \S\ = s; we will sum over all relevant s later. The expected number of sets as 
above is 



where the summation is over all subsets S of V with |5| = s and d(S) = q, and P(m,t,q) is 
the probability that a random matching of 2tti points has exactly t edges leaving a given set 
of q points. Thus 



I 




(5.1) 



P(m, t, q) = [ q ) ( q ) t\M{q - t)M(2m -q- t)M(2m) 




-1 
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where M(k) = ^ k ^l 2 k / 2 * s ^ ne number of perfect matchings of k points (k even). 



Using Stirling's formula and separating out insignificant factors of size we obtain 

P(m,t,q) < m C 



l-\ g \2m~q l q\tt q \q-t 

^0(1) \ L 2m) \t> \2m> 



f£*)(s-t)/2fl - 3+t)rn-(q+t)/2 
v 2m I \ 2m I 

< m°« ((0(l/e)) e {r l -%l - r)) 1/2 ) 

using q < m and with t = eq and r = q/(2m). By taking e close to we can make e e close to 
1, and so the most significant part of this is 



(5.2) 



First consider the case that q/3 < n/2. Since s < q/3 by the fact that vertex degrees are 
all at least 3, we can use 

(s) " C/3) " ( 3en /^) ?/3 - ( 2em /l) q/3 = (e/r) 9 / 3 . 
Multiplying by ()5.2|) . we maximise r l/6(l - r )V2 a t r = 1/4 and find that (JHHJ) is at most 

m° {l \l-e') q (5.3) 

for some e' > when e < eo (some eo sufficiently small). On the other hand, if q/3 > n/2, 
use (") < 2 n < 2 2g / 3 and since r 1//2 (l — r) 1 / 2 < 1/2 the same conclusion is reached. Then 
summing (|5.3|) over all n 2 < g < m, all relevant s and all t < eoQ, the result is o(l). Hence 
the expected number of sets of vertices in the size range being considered (n 0,2 < s < n/2) 
with e(S)/d(S) = t/q < e is o(l). 

We conclude that a.a.s. $ > min(l/5, eo). I 
Proof of Theorem 14.21 For m as in the theorem statement, let G G C(n,m). Then, by 
Lemma 14.71 for some constant c' > the number n' of vertices in K(Rn(G)) is a.a.s. at least 
c'n for N sufficiently large. Moreover this graph is distributed as £3(7/, m'), given n' and 
its number of vertices to'. Then by Lemma 15.11 further restricting this to degree sequence 
V(d) gives graphs with the distribution of V(d). It is well known and easy to verify that, 
since to' = O(n'), a.a.s. the maximum degree occurring in Cz(n' ,m') is o(n ' 02 ). Hence by 
Lemma 15.31 K{Rn{G)) is a.a.s. an a'-expander (with a' being the a from that lemma). Thus 
Rn{G) is a.a.s. an a-expander, where a = a'/N say, since it is obtained from its kernel by 
inserting at most N vertices of degree 2 into each edge. It a.a.s. satisfies property (2) in the 
definition o! f an a-strong core of G by Lemma 14.91 It satisfies property (3) for a < 1/(2N) 
by the definition of the severe stripping process, since vertices of the 2-core that are adjacent 
to more than N vertices outside it are deleted, and during the stripping, any vertices adjacent 
to at least two that have been deleted during stripping are deleted themselves. Thus for 
a sufficiently small, Rn(G) is a.a.s. an a-strong core of G. The theorem then follows by 
Lemma 15.21 which lets us translate results holding a.a.s. to Q(n, m). I 

Acknowledgment The authors wish to thank Elchanan Mossel for helpful discussions at an 
early stage of this research. 
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